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Abstract: Genomic structural changes, such as gene Copy Number Variations (CNVs) are extremely abundant in the hu- 
man genome. An enormous effort is currently ongoing to recognize and catalogue human CNVs and their associations 
with abnormal phenotypic outcomes. Recently, several reports related neuropsychiatric diseases (i.e. autism spectrum dis- 
orders, schizophrenia, mental retardation, behavioral problems, epilepsy) with specific CNV Moreover, for some condi- 
tions, both the deletion and dupUcation of the same genomic segment are related to the phenotype. Syndromes associated 
with CNVs (microdeletion and microduplication) have long been known to display specific neurobehavioral traits. It is 
important to note that not every gene is susceptible to gene dosage changes and there are only a few dosage sensitive 
genes. Smith-Magenis (SMS) and Potocki-Lupski (PTLS) syndromes are associated with a reciprocal microdeletion and 
microdupUcation within chromosome 17pll.2. in humans. The dosage sensitive gene responsible for most phenotypes in 
SMS has been identified: the Retinoic Acid Induced 1 (RAIl). Studies on mouse models and humans suggest that RAIl is 
hkely the dosage sensitive gene responsible for clinical features in PTLS. In addition, the human RAIl gene has been im- 
plicated in several neurobehavioral traits as spinocerebellar ataxia (SCA2), schizophrenia and non syndromic autism. In 
this review we discuss the evidence of RAIl as a dosage sensitive gene, its relationship with different neurobehavioral 
traits, gene structure and mutations, and what is known about its molecular and cellular function, as a first step in the elu- 
cidation of the mechanisms that relate dosage sensitive genes with abnormal neurobehavioral outcomes. 
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GENOMIC STRUCTURAL CHANGES AND NEURO- 
PSYCHIATRIC TRAITS 

Genomic structural changes, such as gene Copy Number 
Variations (CNVs) are extremely abundant in the human 
genome. Approximately 14,400 CNV loci larger than 1 kb 
were identified to date (http://projects.tcag.ca/variation/) [1]. 
In a recent article the impressive amount of more than 1,000 
CNV/individual human genome ranging from 443 bp to 1.28 
megabases (Mb), with a median size of 2.9 kb was reported 
[2]. On the other hand, an enormous effort is currently ongo- 
ing to recognize and catalogue human CNVs associated with 
abnormal phenotypic outcomes [3-7]. Some conditions af- 
fecting the central nervous system, such as schizophrenia, 
mental retardation (MR) and autism spectrum disorder 
(ASD) were shown recently to be associated with both the 
deletion and the duplication of the same genomic segment 
[8-13]. There is a great level of complexity between the 
presence of CNV and the resulting phenotypes that is not 
only a direct consequence of specific altered gene dosage. 
Analyses of the genome-wide functional impact of these 
structural variants showed that copy number changes not 
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only cause alterations in expression levels of genes within 
CNVs but also influence the expression of genes in their 
vicinity [14, 15]. Moreover, a recent study clearly demon- 
strates that the presence of structural changes associated with 
CNV is enough to cause a phenotype independent of gene 
dosage [16]. 

Genomic disorders are the clinical manifestation of 
pathological CNV, they are frequent conditions (~ 1 per 
1,000 births) and often sporadic resulting from de novo rear- 
rangements [17]. In a subset of such conditions the rear- 
rangements comprise multiple unrelated contiguous genes 
that are physically linked and thus have been referred to as 
Contiguous Gene Syndromes (CGS). An increasing number 
of CGS are being described each of them presenting a com- 
plex and specific phenotype. The beauty of these CGS is that 
there are specific phenotypes associated with certain ge- 
nomic regions, thus making the detection of the genes in- 
volved in certain pathways easy. Although the segmental 
aneuploidy usually involves several genes, only a small sub- 
set of them conveys phenotypes as a function of copy num- 
ber alteration. These particular genes are referred as "dosage 
sensitive genes". However, the question of what makes a 
gene or its product a "dosage sensitive gene" remains to be 
elucidated. 

Smith-Magenis (SMS, OMIM# 182290) and Potocki- 
Lupski (PTLS, OMIM# 610883) syndromes are two exam- 
ples of CGS that are associated with a microdeletion and a 
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microduplication respectively within chromosome 17 band 
pi 1.2. SMS was first described in 1986, and has a birth 
prevalence estimated at 1/15,000-1/25,000 [18-20]. The 
clinical phenotype includes craniofacial abnormalities, 
brachydactyly, self-injurious behavior, sleep abnormalities 
and mental retardation. Less commonly reported is cleft pal- 
ate, congenital heart defects, seizures, hearing impairment 
and urinary tract anomalies. Molecular studies revealed a 
common deleted region of ~4 Mb in the majority of SMS 
patients (>70-80%) [21]. Unusual sized deletions (smaller or 
larger) were observed in 20-25% of patients. By examining 
the breakpoints in unusual sized deletions, the SMS critical 
region was redefined to a -950 kb interval, in which fifteen 
genes and eight predicted genes were present [22] (Table 1). 
PTLS was first described in 2000 [23] and is associated with 
the reciprocal duplication of the genomic interval deleted in 
SMS. The clinical presentation of PTLS includes mental 
retardation, autistic features, hyperactivity, developmental 
delay and dysmorphic features. In addition, PTLS patients 
present hypotonia, poor feeding and difficulty to thrive in 
infancy, oral-pharyngeal dysphasia, obstructive and central 
sleep apnea, structural cardiovascular abnormalities, electro- 
encephalogram (EEG) abnormalities, behavioral abnormali- 
ties, and hypermetropia [24-26] (Table 1). 

RAIl IS THE DOSAGE SENSITIVE GENE MOSTLY 
RESPONSIBLE FOR THE SMS AND PTLS CLINICAL 
PRESENTATION 

As mentioned above, only a few genes do appear to be 
clearly haploinsufficient, such that hemizygotes exhibit a 
recognizable phenotype. Point mutations (nonsense and 



frameshift as well as missense alleles) in the Retinoic Acid 
Induced 1 gene (RAIl) were identified in patients with clini- 
cal presentation of SMS but no molecular deletion found by 
FISH [30-33] (Tables 1 and 2), suggesting that RAIl is the 
dosage sensitive gene causative of SMS. In specific, there 
are two cases of patients with SMS that carry missense muta- 
tions that alter amino acids within a highly conserved region 
of the gene (A4685G and G5423A). Both patients have a 
wide spectrum of neurobehavioral phenotypes, including 
sleep disturbance, self-injurious behaviors, mental retarda- 
tion and developmental delay. This supports the fact that 
RAIl is a protein directly involved in these specific neuro- 
behavioral phenotypes, since the point mutations mentioned 
above generate a full length protein with only the change of 
one single amino acid, with stability similar to the wild type 
protein [34]. 

In addition, a recent report showed a patient with PTLS 
phenotype having a duphcation of a very small region that 
only contains the RAIl gene [29]. This is consistent with the 
notion of RAIl as the dosage sensitive gene within this ge- 
nomic region (Table 1). 

Additional support to the idea that RAIl is the dosage 
sensitive gene within this interval comes from studies in 
mouse models. Human chromosome 17pll.2 is syntenic to 
the 32-34 cM region of murine chromosome 11. The number 
and order of the genes are highly conserved [36-38]. By 
chromosome engineering, a strategy developed to introduce 
defined chromosomal rearrangements into the mouse ge- 
nome to delineate gene haploinsufficiency effects (haploid 
genetics) [39-43], a deletion and reciprocal duplication in the 
mouse chromosome region syntenic to the SMS critical in- 



Table 1. Key Neurobehavioral Features Found in SMS and PTLS Patients 



Neurobehavioral features 


SMS 


PTLS 


% in common deletion 


% in RAIl mutations 


% in common duphcation 


% in small duplications 


Variable mental retardation or 
cognitive impairment 


100 


100 


100 


100 


Motor and speech delay 


>90 


70 


100 


100 


Sleep disturbance 


90 


100 


66.6 


75 


Infantile hypotonia 


>90 


61 


88.8 


75 


Overt seizures 


11-30 


16.6 


NR 


NR 


EEG abnormalities 


49 


NR 


66.6-70 


5.8 


Attention seeking 


80-100 


100 


NR 


NR 


Onychotillomania 


25-85 


100 


NR 


NR 


Polyembolokoilamania 


25-85 


80 


NR 


NR 


Hand biting/self-biting 


80 


60 


NR 


NR 


Self-injurious behaviors 


70-90 


100 


NR 


NR 


Self-hugging/hand wringing 


50-80 


100 


NR 


NR 


Head banging/face slapping 


70 


90 


NR 


NR 


Self-picking 


54 


100 


NR 


NR 


Hyperactive 


80 


100 


>70 


66.6 


Autistic features 


NR 


NR 


87.5 


66.6 



A comparison between the phenotypes present in patients with the genomic rearrangements and point mutations of RAIl gene is shown. For SMS, the clinical features are presented 
in the percentage of patients carrying either the common 17pll .2 deletion (-3.7 Mb) or a mutation in RAIl gene. The percentages of features in PTLS are represented for patients 
that harbor the common 17pll.2 duplication (-3.7 Mb) or small duplications (<1 Mb). (NR: not reported) [25, 27-29]. 
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Mutation 


Exon 


Resultant protein 


Mental 
retardation 


dieep 
disturbance 


Speech 
delay 


Motor 


Self 
injurious 


Develop- 
mental delay 


Hypotonia 


''253dell9 


3 


Missincorporation of 60 
aa 


+ 


+ 


+ 


+ 


+ 


NR 


+ 


L L L 7UdV_, 


3 


Missincorporation of 65 
aa 


-{. 


-{. 


NR 


NR 


-{. 


NR 


NR 




3 


Missincorporation of 34 

aa 


^. 


^. 


NR 


^. 


^. 


NR 


^. 




3 


Missincorporation of 8 
aa 


-{. 


-{. 


-{. 


NR 


-{. 


NR 


NR 


"C2878T 


3 


R960X 


+ 


+ 


NR 


NR 


+ 


NR 


NR 


"SlOSinsC 


3 


Missincorporation of 30 
aa 


+ 


+ 


NR 


NR 


+ 


NR 


NR 




3 


Missincorporation of 28 

aa 


-{. 


-{. 




NR 




NR 


NR 


v_y 1 vivxv^ 


3 


Missincorporation of 46 
aa 


-1- 


-1- 


NR 


-1- 


-{- 


NR 


NR 




3 


Missincorporation of 36 
aa 


-{. 


-{. 


NR 


NR 


-{. 


NR 


NR 


'^A4685G 


3 


Q1562R 


+ 


+ 


+ 


+ 


+ 


NR 


+ 


''4933delGCCG 


3 


Missincorporation of 35 
aa 


+ 


+ 


NR 


NR 


+ 


+ 


NR 


="'5265delC 


3 


Missincorporation of 74 
aa 


+ 


+ 


NR 


NR 


+ 


NR 


NR 


'^G5423A 


3 


S1808N 


+ 


+ 


+ 


NR 


+ 


NR 


+ 


''A3634G 
''(PolyQ),8 


3 
3 


S1212G 
(PolyQ),8 


+ 


+ 


+ 




+ 


+ 


+ 



The mutations found in SMS patients are listed with their position in the cDNA, the protein that they generate and the neurobehavioral phenotypes associated with each mutation (+ 
means the presence of the characteristic, NR: not reported) [30-33, 35]. 



terval were generated [44]. The mouse model for SMS, 
Df( 11)17/+ or Del, carries a -1.7 Mb deletion and presents 
craniofacial abnormalities, seizures, obesity, alterations in 
circadian rhythm and hypoactivity, decreased anxiety in the 
plus maze test, social interaction differences in the tube test 
and the social novelty test recapitulating most of the pheno- 
types observed in human patients [15, 44-47]. The mouse 
model for PTLS syndrome, Dp(ll)17/+ or Dup, carries a 
duphcation of a region of -1.7 Mb and are significantly 
leaner than wild type mice, hyperactive and exhibit impaired 
contextual fear conditioning, they display increased anxiety 
behavior in the plus maze test and increased dominance in 
the tube test. Abnormal sociability and preference for social 
novelty was also observed [15, 44, 45]. 

Interestingly enough, most of the phenotypes present in 
Dup mice were rescued in the Dup/RaW compound het- 
erozygote (obtained by mating Dup mice with Rail'^'' mice, a 
model with Rail haploinsufficiency) where the gene dosage 
of Rail was 2n despite the presence of 3 copies of the other 
23 genes in the region; being the first indication of Rail as 
the dosage sensitive gene responsible the PTLS syndrome 
[50]. Consistent with this, there have been reports of mouse 
models with underexpression of Rail [48] and overexpres- 
sion of Rail [51] that presented several of the expected phe- 



notypes. For the last one, aneilyses in both hemizygous and 
homozygous transgenic mice revealed a dosage-dependent 
exacerbation of the phenotype, which included growth retar- 
dation, increased anxiety and hyperactivity [51]. A summary 
can be seen in Table 3. 

Utilizing the Del and Dup mouse models, the levels of 
expression of Rail and the other genes within this specific 
genomic rearrangement were studied. Genome-wide expres- 
sion levels in four organs (cerebellum, heart, kidney and 
testis) from adult male mice (at least three animals of Dup, 
Del, Dup/Del and wild type mice) showed a positive correla- 
tion between gene dosage and expression levels [15, 16]. 
The transcripts within the rearranged interval are expressed 
on average 0.66 (SD=0.15) fold less in the Del (one copy) 
and 1.38 (SD=0.29) fold more in Dup animals (three copies). 
They were, however, unchanged in Del/Dup mice (1.02 
(SD=0.16) fold more, two copies in cis) compared to normal 
controls (two copies in trans) [16]. This clearly showed that 
Rail gene copy number is correlated with levels of expres- 
sion of the gene, and suggests that the protein levels are 
modified concomitant with gene dosage. Interestingly 
enough, most of the genes within this genomic interval be- 
have in the same way, suggesting that is not the levels of 
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Mouse model 


Del 


Rail^- 


Dup 


Dup/RaiV 


Tg Rail 


Overt seizures 


Present (20%) 


Subtle 


No present 


No present 


No present 


EEG 


Abnormal 


Abnormal 


Normal 


ND 


ND 


Locomotor activity 


Hypoactive 


Abnormal 


Hyperactive 


Hyperactive 


Hyperactive 


Anxiety 


Decreased 


Abnormal 


Increased 


Normal 


Increased 




IN Ullllai 


IN Ulllloi 


mipoiicu 


IN Ulllldi 


ND 


Social novelty recognition 


Abnormal 


ND 


Impaired 


ND 


Abnormal 


Dominance like behavior 


Decreased 


ND 


Increased 


ND 


Increased 


Reference 


[16, 44, 45] 


[48-50] 


[15,16,44, 45, 50] 


[50] 


[51,52] 



Summary of the neurobehavioral phenotypes observed in existing mouse models for SMS and PTLS. Del and Rail*'' mice carry one gene copy of Rail, Dup model carries three 
copies, Tg Ruil has several copies and Dup/RaiV has two copies or Rail gene. (N/D: not determined). 



expression that make the difference between Rail and the 
other 23 genes in the region. 

IVIOLECULAR ASPECTS OF RAIl 

RAIl lies in the middle of the Smith-Ivlagenis critical 
region, spanning over 120 kb. It consists of six exons, being 
the third exon the one that contains more than 90% of the 
coding region. 

As is shown in Fig. (1), there are several regions recog- 
nizable in RAIl protein structure. A polymorphic CAG re- 
peat encoding for a polyglutamine tract is present in the N- 
terminus of the RAIl protein. Two poly serine tracts, each 
located at the N and C-terminal regions. The polyserine sig- 
nal is similar to the one present in the DRPLA gene and the 
Drosophila hairless gene, both proteins involved in neuronal 
development [53, 54]. RAIl has two putative bipartite nu- 
clear localization signals (NLSs) in silico predicted. It also 
possesses a zinc finger like plant homeo domain (PHD) in 
the C-terminus, which is present in the trithorax family of 
chromatin remodeling transcriptional regulators [30, 31]. 
The isoform 1 of RAIl corresponds to the cemonical se- 
quence, and it is found in high levels mainly in brain and 
heart [55]. There are other three isoforms produced by alter- 
native splicing which are described in the online database 
Universal Protein Resource (www.UniProt.org). These iso- 
forms differ from the canonical one by the change or lack of 
several residues; the schematic representations of these struc- 
tures are represented in Fig. (1). Isoform 2 is the most simi- 
lar to the canonical, with a length of 1862 amino acids and 
domain conservation [53]. In the isoform 3 (1640 amino ac- 
ids) the second polyserine tract and PHD domain are missing 
[56]. Isoform 4 is the shortest, containing only the first half 
of RAIl protein (966 amino acids) [57] not encompassing 
NLSs and being comparable to the truncated protein found in 
some SMS patients. This short isoform has been found in 
muscle, while the other proteins have been described from 
brain tissue. In addition to the isoforms 2, 3 and 4 of RAIl, 



the online database of eukaryotic genomes, Ensembl project 
(www.ensembl.org) reports other two transcripts 
(ENST00000395776 and ENST00000428973), also gener- 
ated by alternative splicing and both produce the same RAIl 
protein of 1993 amino acids (ENSP00000379122). This pro- 
tein shares a 100% homology with the isoform 1 until the 
residue 1855. It is important to note that in the same data- 
base, several transcripts of RAIl were found, and these are 
consistent with the records of the other variants reported in 
the databases of UniProt and UCSC Genome Browser 
(http://genome.ucsc.edu/). 

Besides the domains described above for the protein, 
there are several post-translational modifications (PTM) sites 
found in RAIl, both in silico and experimentally. Some ex- 
perimental evidence of PTM present in the RAIl protein 
comes from in vitro studies where the isoform 1 and five 
RAIl mutated forms of the protein {RAIl -HA 2687delC, 
RAIl -HA 3103delC, RAIl R960X, RAIl -HA Q1562R, and 
RAIl -HA S1808N, four of them associated with SMS clini- 
cal phenotype) were studied. In this report all proteins pre- 
sented a higher molecular weight than expected when run in 
an SDS-PAGE gel analysis. Moreover, the PTM were be 
mapped to the N-terminal of RAIl [34]. Through the analy- 
sis of RAIl sequence with NetGlycate 1.0, NetOGlyc 3.1 
and YinOYang 1.2 servers [58-60], we found several pre- 
dicted glycation sites homogcnously located within the pro- 
tein sequence and more interestingly, there are several O- 
glycosylation sites with high score, most of them mapping in 
the first amino-terminal half of RAIL There are also O- 
linked p-N-acetylglucosamine (0-GlcNac) predicted sites 
with high score, which are present in many transcription fac- 
tors and neuronal proteins; this modification participates in 
transcription, translation, proteasomal degradation and apop- 
tosis among others and is directly involved in the regulation 
of neuronal development and synaptic transmission [61]. 

Phosphorylation is a well established PTM, implicated in 
protein regulation in a variety of cellular processes. We ran 
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Putative regulator or 
repressor of 
TransacBvatlonal AcMvl^ 




RAIIiMfonnI 



RAI1iwlbnn2 



RAI1 isoform 3 



RMlii0li9nn4 



RAI1 EN8P00000379122 



• PhcsphorytaUonsMBS 

^ Ace^ialionsite 

A Putative SUMOylatkHi site 



Fig. (1). Structure of RAIL By in silico analyses, several domains have been found for RATI : a polyglutamine tract at the N- terminal of the 

protein, two polyserine domains, a PHD domain at the C-terminal of RAll and two putative nuclear localization signals (NLS). RAIl protein 
structure is schematically represented, and the domains described are showed with their localization in the amino acidic sequence. By the 
evaluation of the wild type and mutant proteins associated to SMS, two main domains were found: the N-terminal half of RAIl is responsible 
for the transactivational activity and C-terminal half beginning in residue 1038 is responsible for nuclear localization [34]. The RAIl PTMs 
found are depicted with black figures while the putative sites are represented by white figures. The phosphorylation sites (IThr and 13Ser) 
found in HeLa cells are represented with black circles. The reported acetylation site is depicted with a black diamond. The putative SUMOy- 
lation site with the highest score is represented with a white triangle. 

Below, the other four isoforms described for human RAIl are represented. These isoforms differ from the canonical one by the change of 

several residues (depicted in slanted rectangles) and also by some fragments missed (depicted by discontinued lines). Isoform 2 is the most 
similar to the canonical, while in isoform 3 are missing the second polyserine tract and PHD domain. Isoform 4 is the shortest containing only 
the first half of RAH protein, not encompassing NLSs and being comparable to the truncated proteins found in some SMS patients. The long- 
est variant of RAH (ENSP00000379122) is a protein reported by the online database of eukaryotic genomes, Ensembl, which has a length of 
1993 amino acids and has complete homology with the isoform 1 until amino acid 1855. 



an in silico analysis of putative phosphorylation sites for 
RAIl (NetPhos 2.0 Server [62]), and found the number of 
putative phosphorylable amino acid/total as follows: 
Serl28/217, Thr36/99, and Tyrl3/39, indicating the exis- 
tence of several possible phosphorylation sites. Consistent 
with these predictions, in recent studies to assess the global 
in vivo phosphoproteome one threonine (Thrll86) and thir- 
teen serines (Ser568, 571,574, 687, 1107, 1108, 1110, 1121, 
1122, 1352, 1358, 1374 and 1616) were identified in a phos- 
phorylated state in the RAIl protein, and three of them 



(Ser568, Ser571 and Ser574) were regulated upon EGF 
stimulation in HeLa cells [63-66]. Interestingly, most of 
these sites eire consensus for CKl, GSK3, NEK6 and ERK 

kinases. 

Lysine acetylation is a reversible PTM, which neutralizes 
the positive charge of this amino acid, changing protein 
function. This PTM is very important in the regulation of 
gene expression through the modification of histone proteins. 
Site-specific acetylation of a growing niunber of non-histone 
proteins has been shown to regulate their activity, localiza- 
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tion, specific interactions as well as stability/degradation, 
therefore controlling a variety of cellular processes, such as 
transcription, proliferation, apoptosis and differentiation 
[67]. Interestingly, in another large scale analysis [68] an 
acetylation modification was reported for RAIl at the pep- 
tide KGLEQGGKASDGISK in the Lysine 774 (underlined), 
which also presented a high accessibihty rate in the 3D struc- 
ture. 

SUMOylation (Small Ubiquitin-like Modifier proteins) is 

a post-translational modification involved in various cellular 
processes, such as nuclear-cytosolic transport, transcriptional 
regulation, apoptosis, protein stability, response to stress, 
and progression through the cell cycle [69, 70]. In silico 
analysis (SUMOsp 2.0 [71]) of the RAIl protein sequence 
gave seven putative SUMOylation sites of the Typel: 'P-K- 
X-E, in Lys549, 811, 819, 988, 1328, 1425 and 1500 (being 
Lys819 the most probable one). Therefore, the prediction of 
several sites for PTM within RAIl sequence emd the differ- 
ence between the obtained vs. expected molecular weight of 
the protein, are consistent with a PTM in the N-terminal do- 
main of RAIl (Fig. (1)). 

RAIl MOLECULAR AND CELLULAR FUNCTION 

Little is known about the cellular and developmental role 
of RAIL The first study was made in a mouse embryonic 
carcinoma cell line (P19) where the expression of GTl, a 
splice variant of Rail was markedly up-regulated after the 
treatment with retinoic acid for inducing neuronal differen- 
tiation [72]. 

Initially it was suggested, based in bioinformatic analyses 
that RAIl could function as a transcriptional regulator [30, 
31]. RAIl and TCF20, a transcriptional cofactor, share a 
homology of 50% at the amino acid level, and both genes 
have a similar structure. Furthermore, the presence of the 
polyglutamine stretch at the N-terminal of RAIl is coinci- 
dent with many transcription factors containing polyglu- 
tamine activation domains and it has been shown that these 
stretches can modulate the transcription activation [73]. Con- 
sistent with this speculation, it was reported that mouse Rail 
has transactivational activity in HeLa cells but in moderate 
levels [48]. Considering it had milder activity than classical 
transcription factors, this can indicate that Rail might not act 
alone but may be part of a complex that regulates transcrip- 
tion. In 2010, an independent study showed that mouse Rail 
had much stronger transactivation activity in Neuro-2a cells 
(a neuronal derived cell line) than in HeLa cells [34]. These 
results suggest specificity in the transcription machinery that 
RAIl could be part of and further indicate the importance of 
its role in neuronal derived tissues, since RAIl is a dosage 
sensitive gene involved in neurobehavioral phenotypes. 
Moreover, it was also demonstrated that human RAIl has 
tremscription factor activity [34] and is able to localize to the 
nucleus. Intriguingly, the two missense mutations (RAIl 
Q1562R and RAIl S1808N) were shown to locaUze to the 
nucleus and activate transcription of a reporter gene in the 
same magnitude than the wild type protein, strongly suggest- 
ing that these mutations arc involved in other functions yet 
unknown for RAIl. The analysis of the mutations RAIl-HA 
2687delC, RAIl-HA 3103delC and RAIl R960X showed 
that they generated a truncated polypeptide with cytoplas- 



matic localization and higher transcription factor activity 
than the wild type protein. In addition, two different C- 
terminal halves of RAIl protein (1038 aa-end and 1229 aa- 
end) were able to localize into the nucleus but had no trans- 
activation activity. Taking all this into consideration plus the 
mouse protein studies previously reported [48], two func- 
tional domains were assigned to each half of RAIl: the N- 
terminal that contains transactivational activity and the C- 
terminal half of RAIl responsible for the correct subcellular 
localization of the protein. In a normal situation the two 
functional domains of the protein are together. This suggests 
that the pathogenic outcome of some of the mutated forms 
may be explained by the dissociation of these two domains. 
However, in this context it is intriguing, and needs further 
study, the contribution of the different RAIl isoforms to the 
final phenotype. 

Therefore, it is clear that RAIl is regulating gene expres- 
sion and there are new data from cells and mice regarding to 
this aspect of the protein. A genome-wide gene expression 
study made in HEK293T (human embryonic kidney cells) 
with a 50% knockdown of RAIl generated by siRNA was 
performed using microarrays [74]. The top dysregulated 
genes found in these RAIl haploinsufficient cells are in- 
volved in neuronal differentiation, circadian activity and 
behavior; besides growth signaling, insulin sensitivity, gene 
expression, cell-cycle regulation, lipid biosynthesis, fat mo- 
bilization and skeletal, cardiovascular and renal develop- 
ment. The top candidate genes found upregulated due to 
RAIl haploinsufficiency are ZICl, PSEN2, RXRB, CLN8 and 
SCN12A being involved in neurogenesis, mental retardation, 
sensory transmission, nociceptive behavior, epilepsy and 
neurological function. Amongst the top candidate genes that 
are downregulated there are NFl, MLL and NR1D2, which 
are associated to schizophrenia, neurofibromatosis type I and 
circadian activity [74]. 

Recently in the mouse model for Rail haploinsuffi- 
ciency. Rail*'', several genes were reported to be dysregu- 
lated in the hypothalamus [75]. These genes include Slc6al2 
and Slc38a5, which are related to traits like epilepsy, anxiety 
disorder, mental retardation, whose expression is downregu- 
lated as a consequence of the haploinsufficiency of Rail. 
Furthermore, it was found that human RAIl isoform 1 di- 
rectly regulates the expression of BDNF (brain-derived neu- 
rotrophic factor) [75] which is a neurotrophin that regulates 
neuronal survival, differentiation and growth during brain 
development, among others. The expression of this gene is 
known to be reduced in patients with Alzheimer, Parkinson 
and Huntington diseases [76] . Additionally, BNDF has been 
reported to be associated with obesity and hyperphagia [77, 
78]. 

Moreover, altered gene expression was found for many 
genes in a hippocampus expression profile of the Dup mouse 
model for PTLS [15]. In this study the transcript of Ataxin 2, 
a protein mutated in spinocerebellar ataxia type 2 (SCA2), 
had a fold change of 1.24 by microarray analysis; certainly 
suggesting a hnk between its expression and the overexpres- 
sion of Rail. 

Altogether, the studies showing that CNV of RAIl causes 
dysregulation of many genes involved in many different cel- 
lular processes, suggest that RAIl functions through specific 
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pathways involved in the regulation of various biological 
processes, and when disrupted result in the phenotypic ef- 
fects observed in SMS, PTLS and other neurobehavioral 
traits. 

RAIl HAS BEEN IMPLICATED IN SEVERAL NEU- 
ROBEHAVIORAL TRAITS 

As mentioned before, the haploinsufficiency of RAIl 
causes SMS meanwhile the duplication of a small region 
containing this gene is able to cause PTLS. Besides the 
strong association of RAIl with these two genomic disor- 
ders, this protein has been reported to be related with other 
neiu-obehavioral traits. 

At the N-terminus of the RAIl protein there is a poly- 
morphic CAG repeat, which length is associated with the age 
of onset of spinocerebellar ataxia type 2 [79] and the re- 
sponse to neuroleptic medication in schizophrenia [80]. In 
addition, RAIl has been shown to be associated with non 
syndromic autism [81]. 

Spinocerebellar ataxia type 2 (SCA2) is an autosomal 
dominant disease caused by the increment of a CAG repeat 



tract in ataxin-2 gene, generating an expanded polyglutamine 
tract in this protein. A study of the association between CAG 
repeat length and age at onset variabiUty in SCA2 [79] in 

more than 100 patients found that RAIl locus contributes to 
4.1% of the variance in SCA2 age at onset, after the effect 
caused by the expanded ataxin-2 protein. Therefore, this is 
the only association reporting RAIl as a contributor in a neu- 
rodegenerative disorder, and due to a larger polyglutamine 
tract in RAIl protein, which ranges from 10 to 16 glutamines 
in the normal population [33]. 

It was reported that a CAG repeat polymorphism in RAIl 
is associated with both the severity of the phenotype and the 
response to medication in schizophrenic patients [80]. Com- 
pared to non-responders, schizophrenic subjects responding 
to neuroleptic medication are characterized by a later age of 
onset and a better long-term outcome. The group responding 
to medication carried a significantly shorter CAG repeat size 
compared to control groups. 

It is remarkable to mention that CAG repeat polymor- 
phism of RAIl have not been found to be causative of the 
SMS phenotype; and the length of this CAG repeat has not 
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Fig. (2). RAIl clianges associated with neurobehavioral phenotypes. The traits that have been found related to RAIl CNVs, mutations or 
sequence changes are summarized. There are phenotypes that are associated with only one copy of RAIl gene (SMS). Many of these pheno- 
types are also related to carrying two copies of the gene, but one allele with mutations (SMS caused by frameshifts, nonsense or missense 
mutations) Harboring two copies of RAIl gene but with polymorphic CAG repeats influence other neurobehavioral traits The alteration 
of RAIl dosage to three copies causes specific phenotypes of PTLS as well as features that overlap with some clinical characteristics of SMS. 
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been analyzed in PTLS patients. The most common polyglu- 
tamine tracts in normal and SMS patients are 13-14 Gin. The 
comparison of phenotypes in SMS deletion patients with 
different number of CAG repeats did not show any associa- 
tion between CAG repeat length and the severity of SMS 
phenotype [33]. Recently, in a study with a group of patients 
with non-complex-autism, RAIl was found to be one of the 
novel autism candidate genes within the autism-associated 
CNVs regions [81]. In this study, RAIl was the top most 
significant gene identified in the analysis of 173 non- 
overlapping CNV regions. 

All of these data indicate that RAIl is participating in 
different aspects of behavior regulation. In Fig. (2) are sum- 
marized the neurobehavioral traits related with RAIl varia- 
tions. In summary, there is plenty of evidence indicating that 
RAIl is the dosage sensitive gene responsible for most of the 
phenotypes of SMS and PTLS syndromes, including several 
neurobehavioral traits. 

It is also possible to suggest that RAIl might have a role 
modulating some aspects of neuron function or differentia- 
tion since in addition to its contribution in both SMS and 
PTLS phenotypic outcome, it is highly expressed in neurons, 
its expression is influenced by retinoic acid and has polyglu- 
tamine polymorphisms that have been found to be associated 
with schizophrenia and spinocerebellar ataxia type 2. Taking 
into consideration the information known to date from stud- 
ies in vitro, it can be hypothesized that at the cellular level 
the cause of the resulting phenotypes related to alterations in 
RAIl can be due to (either or together) an abnormal translo- 
cation of the protein, reduced transactivational activity or 
inability of interaction with other proteins and/or DNA due 
to changes in a specific motif or to modifications in the 3D 
structure. 

However, further studies are necessary with RAIl and 
generally with all the other dosage sensitive genes known up 
to date in order to answer the main question that remains 
unanswered: what makes a gene, or its product, a dosage 

sensitive one? 
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